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a b c d e f 



1 - - g 

H : Glu G1 Y Val Arg Ser Gly Asn Asn Leu Thr 

Vc--"' Va * Thr Ser Ser Gly 

-16 Gin Asp He Ser Lys Lys 

22 Tyr Ala Asp Glu Val Glu Ser 

23 His Leu Glu Ser He 
3 4 Leu Lys Asp Val Lys Lys Asn 
41 Leu Lys Lys 

44 Val Gin His Thr Gin Asn Val 

51 Gly Leu He Thr Lys 

56 Leu Ser Glu He Lys Lys Lys 

l\ Tyr 
64 Leu Tyr Asp Leu Lys 
69 Val Asn Val Leu Ser Glu Ala 
76 Glu Leu Thr Ser Lys 

81 Thr Lys Glu Thr 

85 Lys Glu Lys Leu Thr Ala Thr 
92 Phe Glu Gin Phe Lys Lys Asp 

Thr Leu Pro Thr Glu Pro 

iJb Glu Lys Lys 

108 Val Ala Glu Ala Gin Lys Lys 
115 Val Glu Glu Ala Lys Lys Lys 
Ala Glu Asp Gin 
126 Lys Glu Lys Asp Arg Arg Asn 
133 Tyr Pro Thr He Thr 
138 Tyr Lys Thr Leu Glu Leu Glu 
145 He Ala Glu Ser Asp Val Glu 
152 Val Lys Lys Ala Glu Leu Glu 
159 Leu Val Lys Val Lys Ala Lys 
166 Glu Ser Gin Asp Glu Glu Lys 
173 He Lys Gin Ala Glu Ala Glu 
180 Val Glu Ser Lys Gin Ala Glu 
187 Ala Thr Arg 

190 Leu Lys Lys He Lys Thr Asp 
197 Arg Glu Glu Ala' Lys Arg Lys 

2 04 Ala Asp Ala Lys Leu Lys 
210 Glu Ala Val Glu Lys Asn 
216 Val Ala Thr Ser Glu Gin Asp 
223 Lys 

22 4 

Pro Lys Arg Arg Ala Lys Arg Gly Val Ser 
Gly Glu Leu Ala Thr Pro Asp Lys Lys Glu 
Asn Asp Ala Lys Ser Ser Asp Ser Ser Val 
Gly Glu Glu Thr Leu Pro Ser Pro Ser Leu 
Asn Met Ala Asn 

Glu Ser Gin 



244 
254 
264 
268 

271 Thr Glu His Arg Lys Asp 

277 Val Asp Glu Tyr 

2 81 He Lys Lys Met Leu Ser Glu 

2 38 He Gin Leu Asp Arg Arg Lys 
295 " His Thr Gin Asn Val 

300 Asn Leu Asn He Lys 

305 Leu Ser Ala He Lys Thr Lys 
312 Tyr Leu Tyr Glu 

316 Leu Ser Val Leu Lys Glu Asn 

Ser Lys 

325 Lys Glu Glu Leu Thr Ser Lys F*l(3 T>d 

"2 Thr Lys Ala clu ^VW*?^ 

33 6 Leu Thr Ala Ala Phe Glu Gin 
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,3 43 Phe Lys Lys 

346 Asp Thr Leu Lys Pro 

351 Glu Lys Lys 

354 Val Ala Glu Ala Glu Lys Lys 

3 61 Val Glu Glu Ala Lys Lys Lys 

368 Ala Lys Asp Gin 

372 Lys Glu Glu Asp Arg Arg Asn 

379 Tyr 

380 Pro Thr Asn Thr 
3 84 Tyr Lys Thr Leu Glu Leu Glu 
3 91 lie Ala Glu Ser Asp Val Lys 
3 98 Val Lys Glu Ala Glu 

403 Leu Glu Leu Val Lys Glu Glu 

410 Ala Asn Glu Ser Arg Asn Glu 

417 Glu Lys lie Lys Gin Ala 

423 Lys Glu Lys Val Glu Ser Lys 

430 Lys Ala Glu Ala Thr Arg 

436 Leu Glu Lys lie Lys Thr Asp 

443 Arg Lys Lys Ala Glu Glu Glu 

450 Ala Lys Arg Lys 

454 Ala Glu Glu Ser Glu Lys Lys 

461 Ala Ala Glu Ala Lys Gin Lys 

468 Val Asp Ala Glu Glu Tyr Ala 

475 Leu Glu Ala Lys 

47 9 lie Ala Glu Leu Glu Tyr Glu 

486 Val Gin Arg Leu Glu Lys Glu 

493 Leu Lys Glu 

496 lie Asp Glu Ser Asp Ser Glu 

503 Asp Tyr Leu Lys Glu Gly 

509 Leu Arg Ala 

512 Pro Leu Gin Ser Lys 

517 Leu Asp Thr Lys Lys Ala Lys 

524 Leu Ser Lys 

527 Leu Glu Glu Leu Ser Asp Lys 

534 lie Asp Glu Leu Asp Ala Glu 

5 41 lie Ala Lys Leu Glu Val Gin 

548 Leu Lys Asp Ala Glu Gly Asn 

555 Asn Asn 

557 Val Glu Ala Tyr Phe Lys Glu 

5 64 Gly Leu Glu Lys Thr 

5 69 Thr Ala Glu Lys Lys 

574 Ala Glu Leu Glu Lys Ala 

580 Glu Ala Asp Leu Lys Lys Ala 

5 87 Val Asp Glu 

The coiled-coil motif of the a-helix of PspC. Amino acids that are not in the 
coiled-coil motif are in the right column. This is the output from the Matcher program 
accessed through the Internet (http://catt.poly.edu/-jps/). 



FIG.3B 
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Clade B 



DBL6A 



BG8090 



D39- lineage 

I spsA47 

I spsA2 l 

I ^ 
I cbpA 

i c3bp D39 



L81905 




spsA1 




BG9163 



0.05 changes 



F/G.4 
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Choline-binding Domain 



PspC consensus 

11111111112 

1?.34567 °Q"i?^^56 7890 

1 KTGWKQENGNWYFYNTDGSMA 

2 TGWLQNNGSWYYLNAMGAMA 
TGWLQNNGSWYYLNANGSMA 
TGWL.QNNGSWYYLNANGAMA 
TGWLQYNGSWYYLNANGDMA 
TGWLQYNGSWYYLNSNGAMA 
TGWLQYNG SWYYLN ANGDMA 
TGWIiQNNGSWYYLNANGDMA 
TGWLQYN SWYYLNANGDMA 

10 TGWVKDGDTWYYLEASGAMKA 

11 SQVJFKVSDKWYYVNGSGALA 
VNTTVDGYGVNANGEWVN 



3 

4 

5 

6 

7 

8 

9 



Y 



18 amino acid tail PspC 
28% identical to consensus 



PspC consensus repeat 

Yother and Briles 1992. 



For PspA/R36A alingement see 

TGWLQXNGSWYYLNANGAMA *!f ^^^PapC consensus 



VNTTVDGYKVNANGEWV_ 



PSPA/Rxl 17 amino acid tail 
0ne AA different from PspC, 
And one AA shorter. 



FIG.5A 
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PspA/EF5668 



11111111112 



12345678901234567890 

1 IGWKQENGMWYFYNTDGSM A 

2 TGWLQNNGSWYYLNSNGAM A 

3 TGWLQYNGSWYYLNANGAM A 

4 TGWLQYNG S WY YLNANGAM A 

5 TGWLQYNGSWYYLNANGDM A 

6 TGWLQYNG SWYYLNANGDM A 

7 TGWAKVHGSWYYLNANGSM A 

8 TGWVKDGETWYYLEASGSMKA 

9 NQWFQVSDKWYYVNGLGSL S 

10 VNTTVDGYKVNANGEWV 17 amino acid tail EF5668 PspA 
TGWLQYNGSWYYLNANGSM A PspA/EF5668 consensus 



90% identical to PspC consensus- 



Repeat #1 
PspC consensus 
PspA/Rxl 
PspA/EF5668 



KTGWKQENGNWYFYNTDGSMA 
TGWKQENGMWYFYNTDGSMA 
IGWKQENGMWYFYNTDGSM A 



Repeat #N-1 
PspC consensus 
PspA/Rxl 
PspA/EF5668 



TGWVKDGDTWY YLE A S G AMKA 
TGWVKDGDTWYYLEASGAMKA 
TGWVKDGETWYYLEASGSMKA 



Repeat #N 
PspC consensus 
PspA/Rxl 
PspA/EF5668 



SQWFKVSDKWYYVNGSGALA 
SQWFKVSDKWYYVNGLGALA 
NQWFQVSDKWYYVNGLGSLS 



17-18 AA tail 
PspC consensus 
PspA/Rxl 
PspA/EF5668 



VNTTVDGYGVNANGEWVN 

VNTTVDGYKVNANGEWV 

VNTTVDGYKVNANGEWV 



FIG.5B 
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Western Immunoblot of pneumococcal lyates 
Panel A was developed with anti-PspC polyclonal serum and panel B 
was developed with anti-PspA monocional antibody, Xi126. The 
molecular weight markers are indicated. Cross-reaction of the polyclonal 
serum to PspC is observed with all strains tested. 
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bhh Immunized 

r — i Non-immunized 



LXS240 
PspC 



Em en 



UA103 UA13 UA55 
PspA 1-370 PspA 1-315 PspA 1-305 



Antigen 

Level of antibody reactive to PspC and PspA fragments 
present in the sera of mice immunized with PspC. Each bar represents 
the mean of the log reciprocal titer and upperbound of standard error 
of sera from five mice. The limit of detection of the log reciprocal antibody 
titer is 1.8. 



F/G.7 
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Other Formats: FIG. 8 A 

Links: 

LOCUS CAA05158 539 aa BCT 14-OCT-1997 

DEFINITION SpsA protein. 
ACCESSION CAA05158 
PID g2576331 

VERSION CAA05 158.1 GI:2576331 

DBSOURCE embl locus SPSPS A2, accession AJ002054. 1 

KEYWORDS 

SOURCE Streptococcus pneumoniae. 
ORGANISM Streptococcus pneumoniae 

Bacteria; Firmicutes; Baciilus/Clostridium group; Streptococcaceae; 
Streptococcus. 
REFERENCE 1 (residues 1 to 539) 
AUTHORS HammerschmidtS. 
TITLE Direct Submission 

JOURNAL Submitted (13-OCT-1997) Hammerschmidt S., Microbiology, Devision 
Microbial Pathogenesis, National Research Centre for Biotechnology, 
Spielmannstrasse 7, 38106 Braunschweig, GERMANY 
REFERENCE 2 (residues 1 to 539) 
AUTHORS Hanunerschmidt,S., Talay,S.R., Brandtzaeg,P. and Chhatwal,G.S. 
TITLE SpsA, a novel pneumococcal surface protein with specific binding to 

secretory immunoglobulin A and secretory component 
JOURNAL Mol. Microbiol. 25 (6), 1 1 13-1 124 (1997) 
MEDLINE 98010350 
FEATURES Location/Qualifiers 
source 1..539 

/organism^' Streptococcus pneumoniae" 
/strain="type 2" 
/db_xref="taxon: 1 3 1 3" 
Protein 1..539 

/function="IgA binding protein" 
/product="SpsA protein" 
CDS 1..539 

/db_xref="SPTREMBL:033741 " 
/coded_by=" AJ002054. 1 : 1 .. 1 620" 
/transl_table=ll 

ORIGIN 

1 mfaskserkv hysirkfsig vasvavaslv mgswhaten egstqaatss nmaktehrka 
61 akqwdeyie kmlreiqldr rkhtqnvaln iklsaiktky lrelnvleek skdelpseik 
121 akldaafekf kkdtlkpgek vaeakkkvee akkkaedqke edrmyptnt yktleleiae 
181 fdvkvkeael elvkeeakef rnegtikqak ekveskkaea trleniktdr kkaeeeakrk 
241 aaeedkvkek paeqpqpapa tqpekpapkp ekpaeqpkae ktddqqaeed yarrseeeyn 
301 rltqqqppkt ekpaqpstpk tgwkqengmw yfyntdgsma tgwlqnngsw yylnangama 
361 tgwlqnngsw yylnangsma tgwlqnngsw yylnangama tgwlqyngsw yylnsngama 
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F/G.8B 

421 tgwlqyngsw yylnangdma tgwlqnngsw yylnangdma tgwlqyngsw yylnangdma 
481 tgwvkdgdtw yyletsgamk asqwfkvsdk wyyvhgssal ainttvygyg vnangewvn 



the above report in format 
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F/G.8C 

LOCUS SPSPSA2 1620 bp DNA BCT 14-OCT-1997 

DEFINITION Streptococcus pneumoniae SIgA binding. 
ACCESSION AJ002054 
NID g2576330 
VERSION AJ002054.1 GI:2576330 
KEYWORDS SpsA protein. 
SOURCE Streptococcus pneumoniae. 
ORGANISM Streptococcus pneumoniae 

Bacteria; Firmicutes; Bacillus/Clostridium group; Streptococcaceae; 
Streptococcus. 
REFERENCE 1 (bases 1 to 1620) 
AUTHORS Hammerschmidt,S. 
TITLE Direct Submission 

JOURNAL Submitted (13-OCT-1997) Hammerschmidt S., Microbiology, Devision 
Microbial Pathogenesis, National Research Centre for Biotechnology, 
Spielmannstrasse 7, 3 8 1 06 Braunschweig, GERMANY 
REFERENCE 2 (bases 1 to 1620) 
AUTHORS Hammerschmidt,S., Talay,S.R., Brandtzaeg,P. and Chhatwal,G.S. 
TITLE SpsA, a novel pneumococcal surface protein with specific binding to 

secretory immunoglobulin A and secretory component 
JOURNAL Mol. Microbiol. 25 (6), 1 1 13-1 124 (1997) 
MEDLINE 98010350 
FEATURES Location/Qualifiers 
source 1..1620 

/organism-' Streptococcus pneumoniae" 
/strain="type 2" 
/db_xref="taxon:1313" 
CDS 1..1620 

/function-'IgA binding protein" 
/codon_start=l 
/transl_table=ll 
/product="SpsA protein" 
/protein_id="C AA05 158.1" 
/db_xref="PID:e354783" 
/db_xref='TID:g257633 1" 
/db_xref="GI:2576331" 
/db_xref="SPTREMBL:033741" 

/translation="MFASKSERKVHYSIRKFSIGVASVAVASLVMGSWHATENEGST 



QAATSSNMAKTCHRKAAKQWDEYIEKMLREIQLDRRKHTQNVALNIKLSAIKT 
KYLR 



16/50 FIG. 8D 



ELNVLEEKSKDELPSEIKAKLDAAFEK^ 
DQKE 

EDRRNYPTNTYKTLELEIAEFDVKVKEAELELVKEEAKEFRNEGTIKQAK^ 
KKA 

EATRLENIKTDRKKAEEEAKRKAAEEDKVKEKPAEQPQP 
PAEQ 

PKAEKTDDQQAEEDYARRSEEEYNRLTQQQPPKTEKPAQPSTPKTGWKQENGM 
WYFYN 

TDGSMATGWLQNNGSWYYLNANGAMATGWLQNNGSWYYLNANGSMATGW 
LQNNGSWYY 

LNANGAMATG\^QYNGSWYYLNSNGAMATGWLQYNGSWYYLNANGDMAT 
GWLQNNGSW 

YYLNANGDMATGWLQYNGSWYYLNANGDMATGWKDGDTWYYLETSGAM 

KASQWFKVS 

DKWYYVHGSSALAINTTVYGYGVNANGEWVN" 

repeatjregion 961.-1 564 

/rpt_type=DIRECT 
BASE COUNT 625 a 294 c 370 g 331 t 
ORIGIN 

1 atgtttgcat caaaaagcga aagaaaagta cattattcaa ttcgtaaatt tagtattgga 
61 gtagctagtg tagctgttgc cagtcttgtt atgggaagtg tggttcatgc gacagagaac 
121 gagggaagta cccaagcagc cacttcttct aatatggcaa agacagaaca taggaaagct 
181 gctaaacaag tcgtcgatga atatatagaa aaaatgttga gggagattca actagataga 
241 agaaaacata cccaaaatgt cgccttaaac ataaagttga gcgcaattaa aacgaagtat 
301 ttgcgtgaat taaatgtttt agaagagaag tcgaaagatg agttgccgtc agaaataaaa 
361 gcaaagttag acgcagcttt tgagaagttt aaaaaagata cattgaaacc aggagaaaag 
421 gtagcagaag ctaagaagaa ggttgaagaa gctaagaaaa aagccgagga tcaaaaagaa 
481 gaagatcgtc gtaactaccc aaccaatact tacaaaacgc ttgaacttga aattgctgag 
541 ttcgatgtga aagttaaaga agcggagctt gaactagtaa aagaggaagc taaagaattt 
601 cgaaacgagg gcacaattaa gcaagcaaaa gagaaagttg agagtaaaaa agctgaggct 
661 acaaggttag aaaacatcaa gacagatcgt aaaaaagcag aagaagaagc taaacgaaaa 
721 gcagcagaag aagataaagt taaagaaaaa ccagctgaac aaccacaacc agcgccggct 
781 actcaaccag aaaaaccagc tccaaaacca gagaagccag ctgaacaacc aaaagcagaa 
841 aaaacagatg atcaacaagc tgaagaagac tatgctcgta gatcagaaga agaatataat 
901 cgcttgactc aacagcaacc gccaaaaact gaaaaaccag cacaaccatc tactccaaaa 
961 acaggctgga aacaagaaaa cggtatgtgg tacttctaca atactgatgg ttcaatggca 
1021 acaggatggc tccaaaacaa cggttcatgg tactatctaa acgctaatgg tgctatggcg 
1081 acaggatggc tccaaaacaa tggttcatgg tactatctaa acgctaatgg ttcaatggca 
1141 acaggatggc tccaaaacaa tggttcatgg tactacctaa acgctaatgg tgctatggcg 
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FI6.8E 

1201 acaggatggc tccaatacaa tggttcatgg tactacctaa acagcaatgg cgctatggcg 
1261 acaggatggc tccaatacaa tggctcatgg tactacctca acgctaatgg tgatatggcg 
1321 acaggatggc tccaaaacaa cggttcatgg tactacctca acgctaatgg tgatatggcg 
1381 acaggatggc tccaatacaa cggttcatgg tattacctca acgctaatgg tgatatggcg 
1441 acaggttggg tgaaagatgg agatacctgg tactatcttg aaacatcagg tgctatgaaa 
1501 gcaagccaat ggttcaaagt atcagataaa tggtactatg tccatggctc aagtgccctt 
1561 gcaatcaaca caactgtata tggctatgga gtcaatgcca atggtgaatg ggtaaactaa 



the above report in format 



Other Formats: 
Links: 
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FIG. 9 A 



LOCUS CAA05159 581 aa BCT 14-OCT-1997 

DEFINITION SpsA protein. 
ACCESSION CAA05159 
PID g2576333 

VERSION CAA05159.1 GI:2576333 

DBSOURCE embl locus SPSPS A47, accession AJ002055 . 1 

KEYWORDS 

SOURCE Streptococcus pneumoniae. 
ORGANISM Streptococcus pneumoniae 

Bacteria; Firmicutes; Baciilus/Clostridium group; Streptococcaceae; 
Streptococcus. 
REFERENCE 1 (residues 1 to 581) 
AUTHORS Hammerschmidt,S. 
TITLE Direct Submission 

JOURNAL Submitted (1 3-OCT-l 997) Hammerschmidt S., Microbiology, Devision 
Microbial Pathogenesis, National Research Centre for Biotechnology, 
Spielmannstrasse 7, 38106 Braunschweig, GERMANY 
REFERENCE 2 (residues 1 to 581) 
AUTHORS Hammerschmidt,S., Talay,S.R., Brandtzaeg,P. and Chhatwal,G.S. 
TITLE SpsA, a novel pneumococcal surface protein with specific binding to 

secretory immunoglobulin A and secretory component 
JOURNAL Mol. Microbiol. 25 (6), 1 1 13-1 124 (1997) 
MEDLINE 98010350 
FEATURES Location/Qualifiers 
source 1..581 

/organism-' S treptococcus pneumoniae" 
/strain="type 47" 
/db_xref="taxon:1313" 
Protein 1..581 

/function- 'IgA binding protein" 
/product-' SpsA protein" 
CDS 1..581 

/db_xref="SPTREMBL:033742" 
/coded_by=" AJ002055. 1 : 1 1746" 
/transl_table=ll 

ORIGIN 

1 mfaskserkv hysirkfsig vasvavaslv mgswhaten egstqaatss nmaktehrka 
61 akqwdeyie kmlreiqldr rkhtqnvaln iklsaiktky lrelnvleek skdelpseik 
121 akldaafekf kkdtlkpgek vaeakkkvee akkkaedqke edrrnyptnt yktleleiae 
181 fdvkvkeael elvkeeakes megtikqak ekveskkaea trleniktdr kkaeeeakrk 
241 adaklkeanv atsdqgkpkg rakrgvpgel atpdkkenda kssdssvgee tlpssslksg 
301 kkvaeaekkv eeaekkakdq keedrrnypt ntyktldlei aesdvkvkea elelvkeeak 
361 eprdeekikq akakveskka eatrlenikt drkkaeeeak rkaaeedkvk ekpaeqpqpa 
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421 patqpekpap kpekpaeqpk aektddqqae edyarrseee ynrliqqqpp ktekpaqpft 

481 pktgwkqeng mwyfyntdgs matgwlqyng swyylnangd matgwvkdgd twyyleasga 

541 mkasqwfkvs dkwyyvngsg alavnttvdg ygvnangewv n 



FJG.9B 

the above report in format 
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FIG.9C 

Other Foraiats: 
Links: 

LOCUS SPSPSA47 1746 bp DNA BCT 14-OCT-1997 

DEFINITION Streptococcus pneumoniae SIgA binding. 
ACCESSION AJ002055 
N1D g2576332 
VERSION AJ002055.1 GL2576332 
KEYWORDS SpsA protein. 
SOURCE Streptococcus pneumoniae. 
ORGANISM Streptococcus pneumoniae 

Bacteria; Firmicutes; Baciilus/Clostridium group; Streptococcaceae; 
Streptococcus. 
REFERENCE 1 (bases 1 to 1746) 
AUTHORS Hammerschmidt,S. 
TITLE Direct Submission 

JOURNAL Submitted (13-OCT-l 997) Hammerschmidt S., Microbiology, Devision 
Microbial Pathogenesis, National Research Centre for Biotechnology, 
Spielmannstrasse 7, 38106 Braunschweig, GERMANY 
REFERENCE 2 (bases 1 to 1746) 
AUTHORS Hammerschmidt,S., Talay.S.R., Brandtzaeg,P. and Chhatwal,G.S. 
TTTLE SpsA, a novel pneumococcal surface protein with specific binding to 

secretory immunoglobulin A and secretory component 
JOURNAL Mol. Microbiol. 25 (6), 1 1 13-1 124 (1997) 
MEDLINE 98010350 
FEATURES Location/Qualifiers 
source 1..1746 

/organism-' Streptococcus pneumoniae" 
/strain="type 47" 
/db_xref='"taxon: 1 3 1 3" 
CDS 1..1746 

/function- 1 IgA binding protein" 

/codon_start=l 

/transl_table=l 1 

/product="SpsA protein" 

/protein_id="C AA05 1 59. 1 " 

/db_xref="PID:e354769" 

/db_xref="PID:g2576333" 

/db_xref="GI:2576333" 

/db_xref="SPTREMBL:033742" 

/translation="MFASKSERKVHYSIRKFSIGVASVAVASLVMGSWHATENEGST 

QAATSSNMAKTEHRKAAKQWDEYIEKMLREIQLDRRKHTQNVALNIKLSAIKT 
KYLR 
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FI6.9D 

v ;. ELNVLEEKSKDELPSEIKAKLDAAFEKFKKDTLKPGEKVAEAKKKVEEAKKKAE 
'PQKE 

EDRRNYPTNTYKTLELEIAEFDVKVKEAELELVKEEAKESRNEGTIKQAKEKVES 
KKA 

EATRLENIKTDRKKAEEEAKRKADAKLKEANVATSDQGKPKGRAKRGVPGELA 
TPDKK 

END AKS SDSS VGEETLP SSSLKSGKKV AEAEKKVEE AEKKAKD QKEEDRRN YPT 
NTYK 

TLDLEIAESDVKVKEAELELVKEEAKEPRDEEKIKQAKAKVESKKAEATRLENIK 
TDR 

KKAEEEAKRKAAEEDKVKEKPAEQPQPAPATQPEKPAPKPEKPAEQPKAEKTDD 
QQAE 

EDYARRSEEEYNRLIQQQPPKTEKPAQPFTPKTGWKQENGMWYFYNTDGSMAT 
GWLQY 

NGSWYYLNANGDMATGWVKDGDTWYYLEASGAMBCASQWFKVSDKWYYVN 
GSGALAVNT 

TVDGYGVNANGEWVN" 
repeatjregion 1 447 . . 1 690 

/rpt_type=DIRECT 
BASE COUNT 722 a 290 c 408 g 326 1 
ORIGIN 

1 atgtttgcat caaaaagcga aagaaaagta cattattcaa ttcgtaaatt tagtattgga 
61 gtagctagtg tagctgttgc cagtcttgtt atgggaagtg tggttcatgc gacagagaac 
121 gagggaagta cccaagcagc cacttcttct aatatggcaa agacagaaca taggaaagct 
181 gctaaacaag tcgtcgatga atatatagaa aaaatgttga gggagattca actagataga 
241 agaaaacata cccaaaatgt cgccttaaacataaagttga gcgcaattaa aacgaagtat 
301 ttgcgtgaat taaatgtttt agaagagaag tcgaaagatg agttgccgtc agaaataaaa 
361 gcaaagttag acgcagcttt tgagaagitt aaaaaagata cattgaaacc aggagaaaag 
421 gtagcagaag ctaagaagaa ggttgaagaa gctaagaaaa aagccgagga tcaaaaagaa 
481 gaagatcgtc gtaactaccc aaccaatact tacaaaacgc ttgaacttga aattgctgag 
541 ttcgatgtga aagttaaaga agcggagctt gaactagtaa aagaggaagc taaagaatct 
601 cgaaacgagg gcacaattaa gcaagcaaaa gagaaagttg agagtaaaaa agctgaggct 
661 acaaggttag aaaacatcaa gacagatcgt aaaaaagcag aagaagaagc taaacgaaaa 
721 gcagatgcta agttgaagga agctaatgta gcgacttcag atcaaggtaa accaaagggg 
781 cgggcaaaac gaggagttcc tggagagcta gcaacacctg ataaaaaaga aaatgatgcg 
841 aagtcttcag attctagcgt aggtgaagaa actcttccaa gctcatccct gaaatcagga 
901 aaaaaggtag cagaagctga gaagaaggtt gaagaagctg agaaaaaagc caaggatcaa 
961 aaagaagaag atcgccgtaa ttacccaacc aatacttaca aaacgcttga ccttgaaatt 
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1021 gctgagtccg atgtgaaagt taaagaagcg gagcttgaac tagtaaaaga ggaagctaag 
1081 gaacctcgag acgaggaaaa aattaagcaa gcaaaagcga aagttgagag taaaaaagct 
1 141 gaggctacaa ggttagaaaa catcaagaca gatcgtaaaa aagcagaaga agaagctaaa 
1201 cgaaaagcag cagaagaaga taaagttaaa gaaaaaccag ctgaacaacc acaaccagcg 
1261 ccggctactc aaccagaaaa accagctcca aaaccagaga agccagctga acaaccaaaa 
1321 gcagaaaaaa cagatgatca acaagctgaa gaagactatg ctcgtagatc agaagaagaa 
1381 tataatcgct tgattcaaca gcaaccgcca aaaactgaaa aaccagcaca accatttact 
1441 ccaaaaacag gctggaaaca agaaaacggt atgtggtact tctacaatac tgatggttca 
1501 atggcaacag gatggctcca atacaacggt tcatggtatt acctcaacgc taatggtgat 
1561 atggcgacag gttgggtgaa agatggagat acctggtact atcttgaagc atcaggtgct 
1621 atgaaagcaa gccaatggtt caaagtatca gataaatggt actatgtcaa tggctcaggt 
1681 gcccttgcag tcaacacaac tgtagatggc tatggagtca atgccaatgg tgaatgggta 
1741 aactaa 



FIG. 9E 

the above report in format 
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LOCUS AAB70838 663 aa BCT 16-SEP-1997 

DEFINITION choline binding protein A. 
ACCESSION AAB70838 
PID g2425109 

VERSION AAB70838.1 GI:2425109 
DBSOURCE locus AF019904 accession AF019904.1 
KEYWORDS 

SOURCE Streptococcus pneumoniae. 
ORGANISM Streptococcus pneumoniae 

Eubacteria; Firmicutes; Low G+C gram-positive bacteria; 
Streptococcaceae; Streptococcus. 
REFERENCE 1 (residues 1 to 663) 
AUTHORS Rosenow,C, Ryan^*., WeiserJ.N., Johnson,S., Fontan,P., 

Ortqvist,A. and Masure,H.R. 
TrTLE Contribution of novel choline-binding proteins to adherence, 
colonization and immunogenicity of Streptococcus pneumoniae 
JOURNAL Mol. Microbiol. (1997) In press 
REFERENCE 2 (residues 1 to 663) 
AUTHORS Rosenow,C, Ryan,P., WeiserJ.N., Johnson,S., Fontan,P., 

Ortqvist,A. and Masure,H.R. 
TITLE Direct Submission 

JOURNAL Submitted (1 8-AUG- 1 997) Infectious Diseases, St. Jude Children's 

Research Hospital, 322 North Lauderdale, Memphis, TN 38105, USA 
COMMENT Method: conceptual translation supplied by author. 
FEATURES Location/Qualifiers 
source 1 ..663 

/organism— ' Streptococcus pneumoniae" 
/strain="R6x" 
/db_xref="taxon:1313" 
Protein <1..663 

/product=" choline binding protein A" 
/name="CbpA" 
CDS 1..663 

/gene="cbpA" 

/coded_by=" AF019904. 1 :<1 ..1 992" 
/transl_table=ll 

ORIGIN 

1 enegstqaat ssnmaktehr kaakqwdey iekmlreiql drrkhtqnva lniklsaikt 
61 kylrelnvle ekskdelpse ikakldaafe kfkkdtlkpg ekvaeakkkv eeakkkaedq 
121 keedrrnypt ntyktlelei aefdvkvkea elelvkeeak esrnegtikq akekveskka 
181 eatrlenikt drkkaeeeak rkadaklkea nvatsdqgkp kgrakrgvpg elatpdkken 
241 dakssdssvg eetlpssslk sgkkvaeaek kveeaekkak dqkeedrrny ptntyktldl 
301 eiaesdvkvk eaelelvkee akeprdeeki kqakakvesk kaeatrleni ktdrkkaeee 
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361 akrkaaeedk vkekpaeqpq papatqpekp apkpekpaeq pkaektddqq aeedyarrse 
421 eeynrltqqq ppktekpaqp stpktgwkqe ngmwyfyntd gsmatgwlqn ngswyylnan 
481 gamatgwiqn ngswyylnan gsmatgwlqn ngswyylnan gamatgwlqy ngswyylnsn 
541 gamatgwlqy ngswyylnan gdmatgwlqn ngswyylnan gdmatgwlqy ngswyylnan 
601 gdmatgwvkd gdtwyyleas gamkasqwfk vsdkwyyvng sgalavnttv dgygvnange 
661 wvn 



the above report in format 



Othfer Formats: 
Links: 



25/50 F/G./OC 



LOCUS AF019904 2480 bp DNA BCT 22-SEP-1997 

DEFINITION Streptococcus pneumoniae choline binding protein A (cbpA) gene, 

partial cds. 
ACCESSION AFO 19904 
NID g2425108 
VERSION AF01 9904.1 GI:2425108 
KEYWORDS . 

SOURCE Streptococcus pneumoniae. 
ORGANISM Streptococcus pneumoniae 

Eubacteria; Firmicutes; Low G+C gram-positive bacteria; 
Streptococcaceae; Streptococcus. 
REFERENCE 1 (bases 1 to 2480) 
AUTHORS Rosenow,C, Ryan,P., Weiser,J.N., Johnson,S., Fontan,P., 

Ortqvist,A. and Masure,H.R. 
TITLE Contribution of novel choline-binding proteins to adherence, 
colonization and immunogenicity of Streptococcus pneumoniae 
JOURNAL Mol. Microbiol. (1997) In press 
REFERENCE 2 (bases 1 to 2480) 
AUTHORS Rosenow,C, Ryan,P., Weiser,J.N., Johnson,S., Fontan,P., 

Ortqvist,A. and Masure,H.R. 
TITLE Direct Submission 

JOURNAL Submitted (1 8-AUG- 1 997) Infectious Diseases, St. Jude Children's 

Research Hospital, 322 North Lauderdale, Memphis, TN 38105, USA 
FEATURES Location/Qualifiers 
source 1 ..2480 

/organism— ' Streptococcus pneumoniae" 
/strain="R6x" 
/db_xref="taxon: 13 13" 
gene <1..1992 

/gene="cbpA" 
CDS <1..1992 
/gene="cbpA" 
/note="CbpA" 
/codon_start=l 
/transl_table=ll 

/product="choline binding protein A" 
/protein_id=" AAB70838. 1 " 
/db_xref="PID:g2425 1 09" 
/db xref="GI:2425109" 



/translation= 

"ENEGSTQAATSSNMAKTEHRKAAKQVVDEYIEKMLREIQLDRRK 
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htqnValniklsaiktkylrelnvleekskdelpseikakldaafekfkiodtlkp 

GEK 

VAEAKKKVEEAKKKAEDQKEEDRRNYPTNTYKTLELEIAEFDVKVKEAELELV 
KEEAK 

ESRNEGTIKQAKEKVESKKAEATRLENIKTDRKKAEEEAKRKADAKLKEANVAT 
SDQG 

KPKGRAKRGVPGELATPDKKENDAKSSDSSVGEETLPSSSLKSGKKVAEAEKKV 
EEAE 

KKAKDQKEEDRRNYPTNTYKTLDLEIAESDVKVKEAELELVKEEAKEPPvDEEKI 
KQAK 

AKVESKKAEATRLENIKTDRKKAEEEAKRKAAEEDKVKEKPAEQPQPAPATQPE 
KPAP 

KPEKPAEQPKAEKTDDQQAEEDYARRSEEEYNRLTQQQPPKTEKPAQPSTPKTG 
WKQE 

NGMWYFYNTDGSMATGWLQNNGSWYYLNANGAMATGWLQNNGSWYYLNA 
NGSMATGWL 

QNNGSWYYLNANGAMATGWLQYNGSWYYLNSNGAMATGWLQYNGSWYYL 
NANGDMATG 

WLQNNGSWYYLNANGDMATGWLQYNGSWYYLNANGDMATGWVKDGDTWY 
YLEASGAMK 

ASQWFKVSDKWYYVNGSGALAVNTTVDGYGVNANGEWVN" 
BASE COUNT 933 a 452 c 575 g 520 1 
ORIGIN 

1 gaaaacgaag gaagtaccca agcagccact tcttctaata tggcaaagac agaacatagg 
61 aaagctgcta aacaagtcgt cgatgaatat atagaaaaaa tgttgaggga gattcaacta 
121 gatagaagaa aacataccca aaatgtcgcc ttaaacataa agttgagcgc aattaaaacg 
181 aagtatttgc gtgaattaaa tgttttagaa gagaagtcga aagatgagtt gccgtcagaa 
241 ataaaagcaa agttagacgc agcttttgag aagtttaaaa aagatacatt gaaaccagga 
301 gaaaaggtag cagaagctaa gaagaaggtt gaagaagcta agaaaaaagc cgaggatcaa 
361 aaagaagaag atcgtcgtaa ctacccaacc aatacttaca aaacgcttga acttgaaatt 
421 gctgagttcg atgtgaaagt taaagaagcg gagcttgaac tagtaaaaga ggaagctaaa 
481 gaatctcgaa acgagggcac aattaagcaa gcaaaagaga aagttgagag taaaaaagct 
541 gaggctacaa ggttagaaaa catcaagaca gatcgtaaaa aagcagaaga agaagctaaa 
601 cgaaaagcag atgctaagtt gaaggaagct aatgtagcga cttcagatca aggtaaacca 
661 aaggggcggg caaaacgagg agttcctgga gagctagcaa cacctgataa aaaagaaaat 
721 gatgcgaagt cttcagattc tagcgtaggt gaagaaactc ttccaagctc atccctgaaa 
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781 tcaggaaaaa aggtagcaga agctgagaag aaggttgaag aagctgagaa aaaagccaag 
841 gatcaaaaag aagaagatcg ccgtaactac ccaaccaata cttacaaaac gcttgacctt 
901 gaaattgctg agtccgatgt gaaagttaaa gaagcggagc ttgaactagt aaaagaggaa 
961 gctaaggaac ctcgagacga ggaaaaaatt aagcaagcaa aagcgaaagt tgagagtaaa 
1021 aaagctgagg ctacaaggtt agaaaacatc aagacagatc gtaaaaaagc agaagaagaa 
1081 gctaaacgaa aagcagcaga agaagataaa gttaaagaaa aaccagctga acaaccacaa 
1141 ccagcgccgg ctactcaacc agaaaaacca gctccaaaac cagagaagcc agctgaacaa 
1201 ccaaaagcag aaaaaacaga tgatcaacaa gctgaagaag actatgctcg tagatcagaa 
1261 gaagaatata atcgcttgac tcaacagcaa ccgccaaaaa ctgaaaaacc agcacaacca 
1321 tctactccaa aaacaggctg gaaacaagaa aacggtatgt ggtacttcta caatactgat 
1381 ggttcaatgg caacaggatg gctccaaaac aacggttcat ggtactatct aaacgctaat 
1441 ggtgctatgg cgacaggatg gctccaaaac aatggttcat ggtactatct aaacgctaat 
1501 ggttcaatgg caacaggatg gctccaaaac aatggttcat ggtactacct aaacgctaat 
1561 ggtgctatgg cgacaggatg gctccaatac aatggttcat ggtactacct aaacagcaat 
1621 ggcgctatgg cgacaggatg gctccaatac aatggctcat ggtactacct caacgctaat 
1681 ggtgatatgg cgacaggatg gctccaaaac aacggttcat ggtactacct caacgctaat 
1741 ggtgatatgg cgacaggatg gctccaatac aacggttcat ggtattacct caacgctaat 
1801 ggtgatatgg cgacaggttg ggtgaaagat ggagatacct ggtactatct tgaagcatca 
1861 ggtgctatga aagcaagcca atggttcaaa gtatcagata aatggtacta tgtcaatggc 
1921 tcaggtgccc ttgcagtcaa cacaactgta gatggctatg gagtcaatgc caatggtgaa 
1981 tgggtaaact aaacctaata taactagtta atactgactt cctgtaagaa ctttttaaag 
2041 tattccctac aaataccata tcctttcagt agataatata cccttgtagg aagtttagat 
2101 taaaaaataa ctctgtaatc tctagccgga tttatagcgc tagagactac ggagtttttt 
2161 tgatgaggaa agaatggcgg cattcaagag actctttaag agagttacgg gttttaaact 
2221 attaagcctt ctccaattgc aagaggcttc aatctctgct agggtgctag cttgcgaaat 
2281 ggctccacgg agtttggcag cgccagatgt tccacggaga tagtgaggag cgaggccgcg 
2341 gaattcacga actgcgacgt tttctccttt gaggttaatc aatcgtttca agtgttcgta 
2401 ggcgatcttc atcttgtctt caaaggtcaa atcaggtagg atttctcctg tttcaaagtt 
2461 tatggtggcc ctggttgaag 



the above report in format 
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Other Formats: 
Links: 

LOCUS AAD00184 929 aa BCT 07-OCT-1996 

DEFINITION surface protein C. 
ACCESSION AAD00184 
PID g4097980 

VERSION AAD00184.1 GL4097980 
DBSOURCE locus SPU72655 accession U72655.1 
KEYWORDS . 

SOURCE Streptococcus pneumoniae. 
ORGANISM Streptococcus pneumoniae 

Eubacteria; Firmicutes; Low G+C gram-positive bacteria; 
Streptococcaceae; Streptococcus. 
REFERENCE 1 (residues 1 to 929) 
AUTHORS Brooks- Walter^., Tart,R.C, Briles,D.E. and Hollingshead,S.K. 
TITLE The pspC gene encodes a second pneumococcal surface protein 
homologous to the protection-eliciting PspA protein of 
Streptococcus pneumoniae 
JOURNAL Unpublished 
REFERENCE 2 (residues 1 to 929) 
AUTHORS Brooks- Walter,A., Tart,R.C, Briles,D.E. and Hollingshead,S.K. 
TITLE Direct Submission 

JOURNAL Submitted (26-SEP-l 996) Microbiology, University of Alabama at 
Birmingham, 19th Street South, Box 10, Birmingham, AL 35294-2170, 
USA 

FEATURES Location/Qualifiers 
source 1..929 

/organism-' Streptococcus pneumoniae" 

/strain="EF6796" 

/db_xref="taxon: 1313" 
Protein 1..929 

/product=" surface protein C" 
CDS 1..929 

/gene="pspC" 

/coded_by= ,, U72655.1:319..3108 ,, 
/transl_table=ll 

ORIGIN 

1 mfaskserkv hysirkfsig vasvavaslf Iggwhaegv rsgnnltvts sgqdiskkya 
61 deveshlesi lkdvkknlkk vqhtqnvgli tklseikkky iydlkvnvls eaeltsktke 
121 tkekltatfe qfkkdtlpte p'ekkvaeaqk kveeakkkae dqkekdrrny ptityktlel 
181 eiaesdvevk kaelelvkvk akesqdeeki kqaeaevesk qaeatrlkki ktdreeakrk 
241 adaklkeave knvatseqdk pkrrakrgvs gelatpdkke ndakssdssv geetlpspsl 
301 nmanesqteh rkdvdeyikk mlseiqldrr khtqnvnlni klsaiktkyl yelsvlkens 
361 kkeeltsktk aeltaafeqf kkdtlkpekk vaeaekkvee akkkakdqke edrrnyptnt 
421 yktleleiae sdvkvkeael elvkeeanes rneekikqak ekveskkaea trlekiktdr 
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481 kkaeeeakrk aeesekkaae akqkvdaeey aleakiaele yevqriekel keidesdsed 
541 ylkeglrapl qskldtkkak lskleelsdk ideldaeiak levqlkdaeg nnnveayfke 
601 glekttaekk aelekaeadl kkavdepetp apapqpapap ekpaekpapa pekpapapek 
661 papapekpap apekpapape kpaptpetpk tgwkqengmw yfyntdgsma tgwlqnngsw 
721 yyinsngama tgwlqnngsw yylnsngama tgwiqyngsw yylnangdma tgwlqyngsw 
781 yylnangdma tgwfqyngsw yylnangdma tgwfqyngsw yylnangdma tgwlqyngsw 
841 yylnsngamv tgwlqnngsw yylnangsma tdwvkdgdtw yyleasgamk asqwfkvsdk 
901 wyyvngsgal avnttvdsyr vnangewvn 



the above report in format 
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Other Formats: r—is* » 

Links: rIG. I IC 

LOCUS SPU72655 3463 bp DNA BCT 02-JAN-1999 

DEFINITION Streptococcus pneumoniae surface protein C (pspC) gene, complete 
cds. 

ACCESSION U72655 

NID g4097979 

VERSION U72655.1 GI:4097979 

KEYWORDS . 

SOURCE Streptococcus pneumoniae. 
ORGANISM Streptococcus pneumoniae 

Eubacteria; Firmicutes; Low G+C gram-positive bacteria; 
Streptococcaceae; Streptococcus. 
REFERENCE 1 (bases 1 to 3463) 
iJ^H ORS Brooks- Walter,A., Tart,R.C, Briles,D.E. and Hollingshead,S.K. 
TITLE The pspC gene encodes a second pneumococcal surface protein 
homologous to the protection-eliciting PspA protein of 
Streptococcus pneumoniae 
JOURNAL Unpublished 
REFERENCE 2 (bases 1 to 3463) 
MJTHORS Brooks-Walter,A., Tart,R.C., Briles,D.E. and Hollingshead,S.K. 
ITTLE Direct Submission 

JOURNAL Submitted (26-SEP-1996) Microbiology, University of Alabama at 
Burmngham, 19th Street South, Box 10, Birmingham. AL 35294-2170, 
USA 

FEATURES Location/Qualifiers 
source 1..3463 

/organism=" Streptococcus pneumoniae" 

/strain="EF6796" 

/db_xref="taxon: 1313" 
gene 319..3108 

/gene="pspC" 
CDS 31 9.-3 108 

/gene="pspC" 

/codon_start=l 

/transl_table=ll 

/product=" surface protein C" 

/protein_id=" AAD00 184.1" 

/db_xref="PID:g4097980" 

/db_xref="GI:4097980" 

/translation="MFASKSERKVHYSIRKPSIGVASVAVASLFLGGVVHAEGVRSGN 
NLTVTSSGQDISKKYADEVESHLESIXKDVKKNLKKVQHTQNVGLITKLSEIK^ 
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FiG.IID 

Xi: P^yNVLSEAELTSKTKETKEKLTATFEQFKKDTLPTEPEKKVAEAQKKVEEAK 
~KKAE 

DQKEKDRRNYPTITYKTLELEIAESDVEVKKAELELVKVKAKESQDEEKIKQAEA 

SKQAEATRLKIQKTOREEAKRKADAKLKEAVEKNVATSEQDKPKRRAKRGVSG 
ELATP 

DKKENDAKSSDSSVGEETLPSPSLNMANESQTEHRKDVDEYIKKMLSEIOLDPvR 
KHTQ 

NVNLNIKLSAIKTKYLYELSVLKENSKKEELTSKTKAELTAAFEQFKKDTLKPEK 
KVA 

EAEKKVEEAKKKAKDQKEEDRRNYPTNTYKTLELEIAESDVKVKEAELELVKEE 
ANES 

RNEEKIKQAKEKVESKKAEATRLEKIKTDRKXAEEEAKRKAEESEKKAAEAKQK 
VDAE 

EYALEAKIAELEYEVQRLEKELKEIDESDSEDYLKEGLRAPLQSKLDTKKAKLSK 
LEE 

LSDKIDELDAEIAKLEVQLKDAEGNNNVEAYFKEGLEKTTAEKKAELEKAEADL 
KKAV 

DEPETPAPAPQPAPAPEKPAEKPAPAPEKPAPAPEKPAPAPEKPAPAPEKPAPAPE 
KP 

APTPETPKTGWKQENGMWYFYNTDGSMATGWLQNNGSWYYLNSNGAMATG 
WLQNNGSW 

YYLNSNGAMATGWLQYNGSWYYLNANGDMATGWLQYNGSWYYLNANGDM 
ATGWFQYNG 

SWYYLNANGDMATGWFQYNGSWYYLNANGDMATGWLOYNGSWYYLNSNG 
AMVTGWLQN 

NGSWYYLNANGSMATDWVKDGDTWYYLEASGAMKASQWFKVSDKWYYVNG 
SGALAVNT 

TVDSYRVNANGEWVN" 
BASE COUNT 1326 a 591c 755 g 790 1 1 others 
ORIGIN 

1 aagcttatgc ttgtcaataa tcacaaatat gtagatcata tcttgtttag gacagtaaaa 
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61 catcctaatt actttttaaa tattttacct gagttgattg gcttgacctt gttgagtcat 
12,1 gcctatatga cttttgtttt agtttttcca gtttatgcag ttattttgta tcgacgaata 
181 gctgaagagg aaaagttatt acatgaagtt ataatcccaa atggaagcat aaagagataa 
241 atacaaaatt cgatttatat acagttcata ttgaagtgat atagtaaggt taaagaaaaa 
301 atatagaagg aaataaacat gtttgcatca aaaagcgaaa gaaaagtaca ttattcaatt 
361 cgtaaattta gtattggagt agctagtgta gctgttgcca gcttgttctt aggaggagta 
421 gtccatgcag aaggggttag aagtgggaat aacctcacgg ttacatctag tgggcaagat 
481 atatcgaaga agtatgctga tgaagtcgag tcgcatctag aaagtatatt gaaggatgtc 
541 aaaaaaaatt tgaaaaaagt tcaacatacc caaaatgtcg gcttaattac aaagttgagc 
601 gaaattaaaa agaagtattt gtatgactta aaagttaatg ttttatcgga agctgagttg 
661 acgtcaaaaa caaaagaaac aaaagaaaag ttaaccgcaa cttttgagca gtttaaaaaa 
721 gatacattac caacagaacc agaaaaaaag gtagcagaag ctcagaagaa ggttgaagaa 
781 gctaagaaaa aagccgagga tcaaaaagaa aaagatcgcc gtaactaccc aaccattact 
841 tacaaaacgc ttgaacttga aattgctgag tccgatgtgg aagttaaaaa agcggagctt 
901 gaactagtaa aagtgaaagc taaggaatct caagacgagg aaaaaattaa gcaagcagaa 
961 gcggaagttg agagtaaaca agctgaggct acaaggttaa aaaaaatcaa gacagatcgt 
1021 gaagaagcta aacgaaaagc agatgctaag ttgaaggaag ctgttgaaaa gaatgtagcg 
1081 acttcagagc aagataaacc aaagaggcgg gcaaaacgag gagtttctgg agagctagca 
1141 acacctgata aaaaagaaaa tgatgcgaag tcttcagatt ctagcgtagg tgaagaaact 
1201 cttccaagcc catcccttaa tatggcaaat gaaagtcaga cagaacatag gaaagatgtc 
1261 gatgaatata taaaaaaaat gttgagtgag atccaattag atagaagaaa acatacccaa 
1321 aatgtcaact taaacataaa gttgagcgca attaaaacga agtatttgta tgaattaagt 
1381 gttttaaaag agaactcgaa aaaagaagag ttgacgtcaa aaaccaaagc agagttaacc 
1441 gcagcttttg agcagtttaa aaaagataca ttgaaaccag aaaaaaaggt agcagaagct 
1501 gagaagaagg ttgaagaagc taagaaaaaa gccaaggatc aaaaagaaga agatcgccgt 
1561 aactacccaa ccaatactta caaaacgctt gaacttgaaa ttgctgagtc cgatgtgaaa 
1621 gttaaagaag cggagcttga actagtaaaa gaggaagcta acgaatctcg aaacgaggaa 
1681 aaaattaagc aagcaaaaga gaaagttgag agtaaaaaag ctgaggctac aaggttagaa 
1741 aaaatcaaga cagatcgtaa aaaagcagaa gaagaagcta aacgaaaagc agaagaatct 
1801 gagaaaaaag ctgctgaagc caaacaaaaa gtggatgctg aagaatatgc tcttgaagct 
1861 aaaatcgctg agttggaata tgaagttcag agactagaaa aagagctcaa agagattgat 
1921 gagtctgact cagaagatta tcttaaagaa ggcctccgtg ctcctcttca atctaaattg 
1981 gataccaaaa aagctaaact atcaaaactt gaagagttga gtgataagat tgatgagtta 
2041 gacgctgaaa ttgcaaaact tgaagttcaa cttaaagatg ctgaaggaaa caataatgta 
2101 gaagcctact ttaaagaagg tttagagaaa actactgctg agaaaaaagc tgaattagaa 
2161 aaagctgaag ctgaccttaa gaaagcagtt gatgagccag aaactccagc tccggctcct 
2221 caaccagctc cagctccaga aaaaccagct gaaaaaccag ctccagctcc agaaaaacca 
2281 gctccagctc cagaaaaacc agctccagct ccagaaaaac cagctccagc tccagaaaaa 
2341 ccagctccag ctccagaaaa accagctcca actccagaaa ctccaaaaac aggctggaaa 
2401 caagaaaacg gtatgtggta cttctacaat actgatggtt caatggcaac aggctggctc 
2461 caaaacaatg gctcatggta ctacctcaac agcaatggcg ctatggcgac aggatggctc 
2521 caaaacaatg gctcatggta ctacctcaac agcaatggcg ctatggcgac aggatggctc 
2581 caatacaatg gttcatggta ctacctcaac gctaatggtg atatggcgac aggatggctc 
2641 caatacaatg gttcatggta ctacctcaac gctaatggtg atatggcgac aggatggttc 
2701 caatacaatg gttcatggta ctacctcaac gctaatggtg atatggcgac aggatggttc 
2761 caatacaatg gttcatggta ctacctcaac gctaatggtg atatggcgac aggatggctc 
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FIG. I IF 

2821 caatacaatg gttcatggta ctacctaaac agcaatggtg ctatggtaac aggatggctc 
2881 caaaacaatg gctcatggta ctacctaaac gctaacggtt caatggcaac agattgggtg 
2941 aaagatggag atacctggta ctatcttgaa gcatcaggtg ctatgaaagc aagccaatgg 
3001 ttcaaagtat cagataaatg gtactatgtc aatggctcag gtgcccttgc agtcaacaca 
3061 actgtagata gctatagagt caatgccaat ggtgaatggg taaactaaac ttaatataac 
3121 tagttaatac tgacttcctg taagaactct ttaaagtatt ccctacaaat accatatcct 
3181 ttcagtagat aatataccct tgtaggaagt ttagattaaa aaataactct gtaatctcta 
3241 gccggattta tagcgctaga gactacggag tttttttgat gaggaaagaa tggcggcatt 
3301 caagagactc tttaagagag ttacgggttt taaactatta agctttctcc aattgcaaga 
3361 gggcttcaat ctctgctagg tgctagcttg cgaaatggct cccacggagt ttggcrgcgc 
3421 cagatgttcc acggaggtag tgaggagcga ggccgcggaa ttc 



the above report in format 
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FIG. 12 
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FIG. 15 A 



LOCUS PSPC.V26 730 AA PROT SYN 

DEFINITION Streptococcus pneumoniae PspC protein from strain V26 

ACCESSION - 

KEYWORDS - 

SOURCE - 

FEATURES Location/Qualifiers 
PROPEP <1..731 

/note="3 to 2349 of V26.pspC (translated)" 

ORIGIN 

1 MFASKSERKV HYSIRKFSIG VASWVASLF LGGWHAEEV RRGNNLTVTS SGDEVESHYQ 
61 SILEKVRKSL EKDRHTQNVD LIKKLQDIKR TYLYNLKEKP EAELTSKTNK ELDAAFEKFK 
121 KEPELTKKLA EAEKKAKDQK EEDHRNYPTN TYKTIELEIA EAEVGVAKAE LELAQAQVQI 
181 PQDTEKINAA KAKVEAAKSN VKKLEKIKSD IEKTYLYKLD NSTKETPKSR VRRNSPQVGD 
241 SRELKET1DK AKETLSTYMV TRLTKLDPSV FWFADLLMDA KKWEEYKTK LEDASDKKSV 
301 EDLRKEAEGK IESLIVTHQN REKENQPAPQ PGGQAGGSMV VPPVTQTPPS TSQSPGQKAT 
361 EAEKKKLQDL IRQFQEALNK LDDETKTVPD GAKLTGEAGK AYNETRTYAK EWDKSKKLL 
421 SQTAVTMDEL AMQLTKLNDA MSKLKEAKAK LVPEVKPQPE NPEPKPQPEG EKPSVPDINQ 
481 EKEKAKLAIA TYMSKILDDI KKHHLKKEKH HQIVALIKDL DKLRKQALSE IDNVNTKVEI 
541 ENTVHKVFAD MDTWTKFQK GLIQNTPQVP EAQRAQRYQR FQIHQKAPDT PQVPEAPKSP 
601 EVPKVPEAPK APDTPQVPEA PKSPEVPKVS DTPKAPDTPQ VPEAPKSPEV PKVPEAPKAP 
661 DTPQVPEAPK SPEVPKVPDT PKAPDTPQVP EAPKAPDTPQ IPEAPAPETP APAPEAPKTG 
721 WKQENGMWKG 

// 
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- - FIG. I6A 

LOCUS E134.DNA 2405 BP SS-DNA SYN 
DEFINITION - 
ACCESSION - 
KEYWORDS - 
SOURCE - 

FEATURES Location/Qualifiers 

CDS 187.2405 
BASE COUNT 1046 A 386 C 505 G 468 T 0 OTHER 
ORIGIN 

1 CGGCCGCCAG TGTGATGGAT ATCTGCAGAA TTCGCCCTTC GACGAATAGC TGAAGAGGAA 
61 AAGCTATTAC ATGAAGTTAT AATCCCAAAT GGAAGCATAA AGAGATAAAT ACAAAATTCG 
121 ATTTATATAC AGTTCATATT GAAGTGATAT AGTAAGGTTA AAGAAAAAAT ATAGAAGGAA 
IaI ATAAACATGT TTGCATCAAA AAGCGAAAGA AAAGTACATT ATTCAATTCG TAAATTTAGT 
241 ATTGGAGTAG CTAGTGTAGT AGTTGCTAGT CTTGTTATGG GAAGTGTGGT TCATGCGACG 
301 GAGAATGAGG GAATTACCCA AGTAGCCACT TCTTATAATA AGGCAAATGA AAGTCAGACA 
lil GAACATAGGA AAGCTGCTAA ACAAGTCGAT GAAGATATAA AAAAAATGTT GAGTGAGATC 
421 CAAGAATATA TAAAAAAAAT GTTGAGTGAG ATCCAATTAG ATAAAAGAAA ACATACCCAA 
S! A ATGT CAACT TAAACAGAAA GTTGAGCGCA ATTCAAACGA AGTATTTGTA TGAATTAAGA 
lil ' ' l AAAAG AGAAGTCGAA AAAAGAAGAG TTGACGTCAA AAACAAAAAA AGAGTTAGAC 
601 GCAGCTTTTG AGAAGTTTAA AAAAGAACCA GAACTTACTA AAAAATTAGC AGAAGCTAAA 
661 CAAAAAGCCA AGGCTCAAAA AGAAGAAGAT TTCCGTAACT ACCCAACCAA TACTTACAAA 
721 ACGCTTGAAC TTGAAATTGC TGAGTTCGAT GTGAAAGTTA AAGAAGCGGA GCTTGAACTA 
781 GTAAAAGAGG AAGCTAAACC CCGAAACGAG GAAAAAATTA AGCAAGCAAA AGCGAAAGTT 
GAGAGTAAAA AAGCTGAGGC TACAAGGTTA GAAGAAATCA AGACAGAACG TAAAAAAGCA 
„2l GAAGAAGAAG CTAAACGAAA AGCAGAAGAA TCTGAGAAAA AAGCTGCTGA AGCCAAACAA 
961 AAAGTGGATA CTAAAGAGCA AGGTAAACCA AAGAGGCGGG CAAAACGAGG 
AGTTTCTGGA 

1021 GAGCTAGCAA CACCTGATAA AAAAGAAAAT GATGCGAAGT CTTCAGATTC TAGCGTAGGT 
1081 GAAGAAACTC TTCCAAGCCC ATCCCTTAAT ATGGCAAATG AAAGTCAGAC AGAACATAGG 
™ A^G^GTCG ATGAATATAT AAAAAAAATG TTGAGTGAGA TCCAATTAGA TAGAAGAAAA 
™ ^TACCCAAA A TGTCAACTT AAACATAAAG TTGAGCGCAA TTAAAACGAA GTATTTGTAT 
I ™3 2 A ^ZX AAGTG TTTTAAAAGA GAACTCGAAA AAAGAAGAGT TGACGTCAAA AACCAAAGCA 
1321 GAGTTAACCG CAGCTTTTGA GCAGTTTAAA AAAGATACAT TGAAACCAGA AAAAAAGGTA 
1381 GCAGAAGCTG AGAAGAAGGT TGAAGAAGCT AAGAAAAAAG CCAAGGATCA 
AAAAGAAGAA 

3213 2 A S GCCGTA ACTAC CCAAC CAATACTTAC AAAACGCTTG AACTTGAAAT TGCTGAGTCC 

™ GATGTGAAAG TTAAAAAAGC GGAGCTTGAA CTAGTAAAAG AGGAAGCTAA CGAATCTCGA 

™ ™™ GAAA AAATTAAGCA AGCAAAAGAG AAaGTTGAGA GTAAAAAAGC TGAGGCTACA 

ill] AAATCAAGAC AGATCGTAAA AAAGCAGAAG AAGAAGCTAA ACGAAAAGCA 

]?51 5 AAGAATCTG AG AAAAAAGC TGCTGAAGCC AAACAAAAAG TGGATGCTGA AGAATATGCT 

™ 2T3H™ CTA ^TCGCTGA GTTGGAATAT GAAGTTCAGA GACTAGAAAA AGAGCTCAAA 

3o21 GAG : 7 GA T G AG TCTGACTC AGAAGATTAT CTTAAAGAAG GCCTCCGTGC TCCTCTTCAA 

™ Z^T^ 7700 A TACCAAAAA AGCTAAACTA TCAAAACTTG AAGAGTTGAG TGATAAGATT 

«2« GA J?^ GTTAG ACGCTGAAA T TGCAAAACTT GAAGTTCAAC TTAAAGATGC TGAAGGAAAC 

1X2 ^^1°™° A^CCTACTT TAAAGAAGGT TTAGAGAAAA CTACTGCTGA GAAAAAAGCT 

™ GA ^T7 AGAAA AACCTGAAGC TGACCTTAAG AAAGCAGTTG ATGAGCCAGA AACTCCAGCT 

* ™ GGCTCCTC ^CCAGCTCC AGCTCCAGAA AAACCAGCTG AAAAACCAGC 
TCCAGCTCCA 

TGAAAAACCA CAGAAA AACCAGCTCC AG CTCCAGAA AAACCAGCTG AAAAACCAGC 

..™V GC 7 GAAGAAC CAG CTGAAAA ACCAGCTCCA GCTCCAGAAA AACCAGCTCC 
AACTCCAGAA 

„ AAACCAGCTC CAACTCCAGA AACTCCAAAA ACAGGCTGGA AACAAGAAAA 

CGGTATGTGG > 
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D?F.N?nON°' 9DNA 2774BPSS - DN A SYN 
ACCESSION - 

KEYWORDS - CTir* t-7A 

source . rMjjfA 

FEATURES Location/Qualifiers 
CDS 36Z.2467 

Ow!l£ OU ? T 105?A 4?6C 615G 624T 2 OTHER 

61 ta^SSaII t??Z? a £ act ataga atact CAAGCTATGC ATCAAGCTTA TGCTTGTCAA 

8 TTT^GTTTTT £cS™£ ™° CTT GTTGAGT CATGCTTATG TGACTTTTGT 

241 ATTACATG AA^T^tTatT?- ^a^TZ- A ZT TT GTATCGACGA ATAGCTGAAG AGGAAAAGCT 
301 TATACAGTTC SISJSilS CAAATGGAAG CATAAAGAGA TAAATACAAA ATTCGATTTA 
36 CATG^SS Tr AY^Z^ TATAGTfiA GGTTAAAGAA AAAATATAGA AGGAAATAAA 
42 A^TaS^AGT rT^S^ 3 ^ 61 ACATTATTCA ATTCGTAAAT TTAGTATTGG 
481 crlrrJALV-r 7£S?7 GTTG CCAGTCTTGT TATGGGAAGT GTGGTTCATG CGACAGAGAA 
S ^^^T^^^M<G^^TC^^r^f^ A ^I^^^^ TAATATGGCA AAGACAGAAC ATAGGAAAGC 
601 AAGAAAACAT Irr 25? ™ ATATAGA AAAAATGTTG AGGGAGATTC AACTAGATAG 
66 TTTGCGTGAA ™ J^ 00 ™^ CATAAAGTTG AGCGCAATTA AAACGAAGTA 

721 It^aaaX™ V^J GTTT TAGA AGAGAA GTCGAAAGAT GAGTTGCCGT CAGAAATAAA 
7? GGTAfYAP^^v'TAAr^^''^' TTGAGAAGTT TAAAAAAGAT ACATTGAAAC CAGGAGAAAA 
S A^A^ ^^^^ ATCAAAAAGA 
901 GTTCGATPTf^ SSKiSIfSS CAACCAATAC TTACAAAACG CTTGAACTTG AAATTGCTGA 
X TxSSS^t^ CTAAAGAATC 
1021 TACA^GG^ SSa^JT* AGCAAGCAAA AGAGAAAGTT GAGAGTAAAA AAGCTGAGGC 

oi aqS^^^5^^ qac ^ tcg taaaa aagca GAAGAAGAAG CTAAACGAAA 
aaccaaaTsg^ AAGTTGAAGG AAGCTAATGT AGCGACTTCA gatcaaggta 

AAAATCA?§? GCAAAA CGAGGAGTTC CTGGAGAGCT AGCAACACCT GATAAAAAAG 

\ a? SSSS^ ^USJSSS. taggtgaaga AACTCTTCCA AGCTCATCCC tgaaatcagg 
ccaagc^atca^ gcaga agctg agaagaaggt tgaagaagct gagaaaaaag 

\ sal tg^S^?^ ?at5.?S gta actac ccaac caatacttac aaaacgcttg accttgaaat 
aggaa g ctaa catgtgaaag ttaaagaagc ggagcttgaa ctagtaaaag 

GTA^JiS^Sc 0070 ^ GACGAGGAAA AAATTAAGCA AGCAAAAGCG AAAGTTGAGA 

i 56? aogaaSSS ^?S a 21^^ tc/vaqac agat cgtaaa aaagcagaag AAGAAGCTAA 

62 GCCG^TACT?^?^^? ATAAAGTTAA AGAAAAACCA GCTGAACAAC CACAACCAGC 
AACAACCAAA CAACCAGAAA AACCAGCTCC AAAACCAGAG AAGCCAGCTG 

80 TCCAAaSS 7?nSS£t C f G0MKCCGCC AAAAACTGAA AAACCAGCAC AACCATCTAC 
86 m?g^ca??7t^ c ^ gamacgg TATGTGGTAC TTCTACAATA CTGATGGTTC 

1 92 tatgScSaca ??5TS22SS AAAACAACGG TTCATGGTAC TATCTAAACG CTAATGGTGC 
98 AATOgSKS rr^lrrnZ^ AAAACAATGG TTCATGGTAC TATCTAAACG CTAATGGTTC 

204 TATgScgISa ??1S2SS5 AAAACAATGG TTCATGGTAC TACCTAAACG CTAATGGTGC 
GCAATGGCGC GGATGGCTCC AATACAATGG TTCATGGTAC TACCTAAACA 

CTAATCStS? CGACA GGATGGCTCC AATACAATGG CTCATGGTAC TACCTCAACG 
2161 TATGGCGACA GGATGGCTCC AAAACAACGG TTCATGGTAC TACCTCAACG 
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FIG.I7B 



CTAATGGTGA 

S™S GACA GGA TGGCTCC AATACAACGG TTCATGGTAT TACCTCAACG CTAATGGTGA 
CATCAGGTGC GGTTGGGTGA AAGATGGANA TACCTGGTAC TATCTTAAAG 

Z™ A ^ A - GCA AGCCAATGGT TCAAAGTATC AGATAAATGG TACTATGTCA ATGGCTCAGG 
GTGAATCGGT A GTCAACACAA CTGTAGATGG CTATGGAGTC AATGCCAATG 

?S1 ^^«I AAACC TAA TATAACT AGTTAATACT GACTTCCTGT AAGAACTTTT TAAAGTATTC 
ocol ??T A ?^ AATA CCAT ATCCTT TCAGTAGATA ATATACCCTT GTAGGAAGTT TAGATTAAAA 
?S ^3^ C A T? TG TAATCT CTAG CCGGATTTAT AGCGCTAGAG ACTACGGAGT I I I I II GATG 
;Snl ™^ AAGAAT GGCGGCATTC AAGAGACTCT TTAAGAGAGT TACGGGTTTT AAACTATTAA 

^ J?', LtJ CCTTCTC CA ATTGCAAGAG GGCTTCAATC TCTGCTAGGG TGCTAGCTTG 

CGAAATGGCT 

2761 CCACGGAGTT TNGC 

// 
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LOCUS BG9163.DNA 2489 BP SS-DNA SYN 
DEFINITION - 
ACCESSION - 

KEYWORDS - CIS* I O A 

SOURCE - /V C7. / OA 

FEATURES Location/Qualifiers 
CDS 221. .2489 

BASE COUNT 1026 A 429 C 541 G 493 T 0 OTHER 

ORIGIN 

J-?5n2I« TAC GACCACTATA GGGCGAATTG GGCCCGACGT CGCATGCTCC CGGCCGCCAT 
a a a™??J? T ATTCGAC GAA TAGCTGAAGA GGAAAAGCTA TTACATGAAG TTATAATCCC 
III a^I??-^ 60 ATAAA< 3AGAT AAATACAACA ttcgatttat atacagttcc TATTGAAGTG 
253 AAG GTTAAAGAAA AAATATAGAA GGAAATAAAC ATGTTTGCAT CAAAAAGCGA 

™« ^~^¥£tl A CATTATTCAA TTCGTAAATT TAGTATTGGA GTAGCTAGTG TAGCTGTTGC 
Z°] „^ST^ C "^AGGAGGAG TAGTCCATGC AGAAGGGGTT AGAAGTGGGA ATAACCTCAC 
IV, G 2IT A ^TS T agtgg gcaag ATATATCGAA GAAGTATGCT GATGAAGTCG AGTCGCATCT 
Zl] A . GA f^J ATA TTGAAGGATG TCAAAAAAAA TTTGAAAAAA GTTCAACATA CCCAAAATGT 
ell C , TT AATT ACAAAGTTGA GCGAAATTAA AAAGAAGTAT TTGTATGACT TAAAAGTTAA 
ttl .' ' I ' A TCG GAAG CTGAGT TGACGTCAAA AACAAAAGAA ACAAAAGAAA AGTTAACCGC 
ill ' ' ' GAG CAGTTTAAAA AAGATACATT ACCAACAGAA CCAGAAAAAA AGGTAGCAGA 

~] r^Z, G AAGGTTGAAG AAGCTAAGAA AAAAGCCGAG GATCAAAAAG AAAAAGATCG 
III aT™ tac ccaa ccatta CTTACAAAAC GCTTGAACTT GAAATTGCTG AGTCCGATGT 
III SST^fTI*** aaagcggagc TTGAACTAGT AAAAGTGAAA GCTAAGGAAT CTCAAGACGA 
onl aa^^J 1 " ^QCAAGCAG AAGCGGAAGT TGAGAGTAAA CAAGCTGAGG CTACAAGGTT 
o21 AA ^ A ^ A 7 C AAGACAGATC GTGAAGAAGC TAAACGAAAA GCAGATGCTA AGTTGAAGGA 
GGGCA^CG AAGAATGTAG CGACTTCAGA GCAAGATAAA CCAAAGAGGC 

!!£2 ^TA^ITST GG AGAGCTAG CAACACCTGA TAAAAAAGAA AATGATGCGA AGTCTTCAGA 
i53 VPJ^F^J A GGTGAAG AAA CTCTTCCAAG CCCATCCCTT AATATGGCAA ATGAAAGTCA 

II ? A a AGA ^ CAT AGGA AAGATG TCGATGAATA TATAAAAAAA ATGTTGAGTG AGATCCAATT 
oil AAACATACCC CAAATGTCAA CTTAAACATA AAGTTGAGCG CAATTAAAAC 

<£l Iaaa^ZIT 0 TATGAATTAA GTGTTTTAAA AG AGAACTC G AAAAAAGAAG AGTTGACGTC 

\ III AAA ^ ( r < r AAA QCAGAGTTAA CCGCAGCTTT TGAGCAGTTT AAAAAAGATA CATTGAAACC 

aaA^a a G ^ AAAAAAA GTAGCAGAAG CTGAGAAGAA GGTTGAAGAA GCTAAGAAAA 
AAGCCAAGGA 

HV, I? AAAA ^2 AA GAAGATCGC C GTAACTACCC AACCAATACT TACAAAACGC TTGAACTTGA 
AAG AG GAAGC TCCGATGTGA AAGTTAAAGA AGCGGAGCTT GAACTAGTAA 

1 Itl I AA S AA Z?JJ :GAAACGAGG aaaaaattaa GCAAGCAAAA GAGAAAGTTG AGAGTAAAAA 

III t?5 I S*? ACAAGGTTAG AAAAAATCAA GACAGATCGT AAAAAAGCAG AAGAAGAAGC 
T?i t^p™ gcagaagaat CTGAGAAAAA AGCTGCTGAA GCCAAACAAA aagtggatgc 
mi I GAAGA ^™ GCTCTTGAAG CTAAAATCGC TGAGTTGGAA TATGAAGTTC AGAGACTAGA 
m t^?™^ AAAGAGATTG ATGAGTCTGA CTCAGAAGAT TATCTTAAAG AAGGCCTCCG 
III ^ aS^IaTI caatctaaat tggataccaa AAAAGCTAAA CTATCAAAAC TTGAAGAGTT 

qai ?< AG ^ GA I™ ATTGATGAGT TAGACGCTGA AATTGCAAAA CTTGAAGTTC AACTTAAAGA 
llll Tr-Alf^?^^ CAATAATG TAQAAGCCTA CTTTAAAGAA GGTTTAGAGA AAACTACTGC 
finl IriAirrr^ GCTGAATTAG AAAAAGCTGA AGCTGACCTT AAGAAV\GCAG TTGATGAGCC 
CTGAAAMCC^ GCTCCGGCTC CTCAACCAGC TCCGGCTCCA GAAAAACCAG 

CTCCAGCTCX CAGCT CCAGCTCCAG AAAAACCAGC TCCAGCTCCA GAAAAACCAG 
CAGCTCCAGC^ 0 ^ GCTCCAGCTC CAGAAAAACC AGCTC C AGCT CCAGAAAAAC 

2281 TCCAGAAAAA CCAGCTCCAG CTCCAGAAAA ACCAGCTCCA GCTCCTAAAC CAGAAACTCC 
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F/GJ8B 



2341 AGAAACAGGC TGGAAACAAG AAAACGGTAT GTGGTACTTC TACAATACTG ATGGTTCAAT 
2401 GGCAACAGGC TGGCTCCAAA ACAATGGCTC ATGGTACTAC CTCAACAGCA 
ATGGCGTTAT ' 

2461 GGCGACAGGA TGGTTCCCAA ACAATGGTC 

// 
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FIG. 19 



LOCUS BG8090.DNA 1680 BP SS-DNA SYN 
DEFINITION - 
ACCESSION - 
KEYWORDS - 
SOURCE - 

FEATURES Location/Qualifiers 

CDS 220..1680 
BASE COUNT 714 A 287 C 375 G 304 T 0 OTHER 
ORIGIN - 

1 ATTGTATACG ACTCACTATA GGGCGAATTG GGCCCGACGT CGCATGCTCC CGGCCGCCAT 
61 GGCCGCGGGA TTCGACGAAT AGCTGAAGAG GAAAAGCTAT TACATGAAGT TATAATCCCA 
1 21 AATGGAAGCA TAAAGAGATA AATACAAAAT TCGATTTATA TACAGTTCAT ATTGAAGTGA 
1 81 TATAGTAAGG TTAAAGAAAA AATATAGAAG GAAATAAACA TGTTTGCATC AAAAAACGAA 
241 AGAA AAGTA C ATTATTCAAT TCGTAAATTT AGTATTGGAG TAGCTAGTGT AGCTGTTGCC 
301 AGTC \ \ \ i i A TGGGAAGTGT GGTTCATGCG ACAGAGAAGG AGGTAACTAC CCAAGTAGCC 
361 ACTTCTTTTA ATAAGGCAAA TAAAAGTCAG ACAGAACATA TGAAAGCTGC TAAACAAGTC 
421 GATGAATATA TAACAAAAAA GCTCCAATTA GATAGAAGAA AACATACCCA AAATGTCGGC 
481 TTACTCACAA AGTTGGGCGT AATTAAAACG GAGTATTTGC ATAGATTAAG TGTTTCAAAA 
541 GAGAAGTCGG AAGCTGAGTT GCCGTCAGAA ATAAAAGCAA AGTTAGACGC AGCTTTTGAG 
601 CAGTTTAAAA AAGATACATT ACCAACAGAA CCAGGAAAAA AGGTAGCAGA AGCTGAGAAG 
661 AAGGTTGAAG AAGCTAAGAA AAAAGCCGAG GATCAAAAAG AAGAAGATCG TCGTAACTAC 
721 CCAACCATTA CTTACAAAAC GCTTGAACTT GAAATTGCTG AGTCCGATGT GGAAGTTAAA 
781 AAAGCGGAGC TTGAACTAGT AAAAGAGGAA GCTAAGGGAT CTCGAAACGA GCAAAAAGTT 
841 AACCAAGCAA AAGCGAAAGT TGAGAGTAAA CAAGCTGAGG CTACAAGGTT AAAAAAAATC 
901 AAGACAGATC GTGAACAAGC TGAGACTACA AGGTTAGAAA ACATCAAGAC AGATCGTGAA 
961 AAAGCAGAAG AAGCTAAACG AAAAGCAGAT GCTAAAGAGC AAGATGAATC 
AAAGAGGCGG 

1021 GTAAAAGGAG GAGTTCCGGG AGAGCAAGCA ACACTTGATA AAAAAGAAAA 
TGATGCGAAG 

1081 TCTTCAGATT CTAGCGTAGG TGAAGAAACT CTTCCAAGCC CATCCCTGAA ATCAGGAAAA 
1 141 AAGGTAGCAG AAGCTGAGAA GAAGGTTGCA GAAGCTGAGA AAAAAGCCAA 
GGATCAAAAA 

1201 GAAGAAGATC GCCGTAACTA CCCAACCAAT ACTTACAAAA CGCTTGAACT TGAAATTGQT 
1 26 1 G AGTCCGATG TGAAAGTTAA AGAAGCGGAG CTTGAACTAG TAAAAGAGG A 
AGCTAAGGAA 

1 321 TCTCGAAACG AGGAAAAAGT TAAGCAAGCA AAAGCGGAAG TTGAGAGTAA 
AAAAGCTGAG 

1 381 GCTACAAGGT TAGAAAAAAT CAAGACAGAT CGTAAAAAAG CAGAAGAAGC TAAACGAAAA 
1441 GCAGCAGAAG AAG ATAAAGT TAAAGAAAAA CCAGCTGAAC AACCACAACC 
AGCGCCGGCT 

1 501 CCTCAACCAG AAAAACCAGC TCCAGCTCCA AAACCAGAGA ATCCAGCTGA ACAACCAAAA 
1 561 GCAGAAAAAC CAGCTGATCA ACAAGCTGAA GAAGACTATG CTCGTAGATC AGAAGAAGAA 
1 621 TATAATCGCT TGACTCAACA GCAACCGCCA AAAACTGAAA AACCAGCACA ACCATCTACT 
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LOCUS L81905.DNA 1766 BP SS-DNA SYN 
DEFINITION - 
ACCESSION - 
KEYWORDS - 
SOURCE - 

FEATURES LocationyQualifiers 

CDS 217..1766 
BASE COUNT 741 A 313 C 402 G 309 T 1 OTHER 
ORIGIN 

1 GTATACGACT CACTATAGGG CGAATTGGGC CCGACGTCGC ATGCTCCCGG 
CCGCCATGGC 

61 CGCGGGATTC GACGAATAGC TGAAGAGGAA AAGCTATTAC ATGAAGTTAT AATCCCAAAT 
121 GGAAGCATAA AGAGATAAAT ACAAAATTCG ATTTATATAC AGTTCATATT GAAGTGATAT 
1 81 AGTAAGGTTA AAGAAAAAAT ATAGAAGGAA ATAAACATGT TTGCATCAAA AAGCGAAAGA 
241 AAAGTACATT ATTCAATTCG TAAATTTAGT GTTGGAGTAG CTAGTGTAGT TGTTGCCAGT 
301 CTTGTTATGG GAAGTGTGGT TCATGCGACA GAGAACGAGG GAGCTACCCA 
AGTACCCACT 

361 TCTTCTAATA GGGCAAATGA AAGTCAGGCA GAACAAGGAG AACAACCTAA AAAACTCGAT 
421 TCAGAACGAG ATAAGGCAAG GAAAGAGGTC GAGGAATATG TAAAAAAAAT AGTGGGTGAG 
481 AGCTATGCAA AATCAACTAA AAAGCGACAT ACAATTACTG TAGCTCTAGT TAACGAGTTG 
541 AACAACATTA AGAACGAGTA TTTGAATAAA ATAGTTGAAT CAACCTCAGA AAGCCAACTA 
601 CAGATACTGA TGATGGAGAG TCGATCAAAA GTAGATGAAG CTGTGTCTAA GTTTGAAAAG 
661 GACTCATCTT CTTCGTCAAG TTCAGACTCT TCCACTAAAC CGGAAGCTTC AGATACAGCG 
721 AAGCCAAACA AGCCGACAGA ACCAGGAGAA AAGGTAGCAG AAGCTAAGAA 
GAAGGTTGAA 

781 GAAGCTGAGA AAAAAGCCAA GGATCAAAAA GAAGAAGATC GTCGTAACTA CCCAACCATT 
841 ACTTACAAAA CGCTTGAACT TGAAATTGCT GAGTCCGATG TGGAAGTTAA AAAAGCGGAG 
901 CTTGAACTAG TAAAAGTGAA AGCTAACGAA CCTCGAGACG AGCAAAAAAT TAAGCAAGCA 
961 GAAGCGGAAG TTGAGAGTAA ACAAGCTGAG GCTACAAGGT TAAAAAAAAT CAAGACAGAT 
1021 CGTGAAGAAG CAGAAGAAGA AGCTAAACGA AGAGCAGATG CTAAAG AG C A 
AGGTAAACCA 

1081 AAGGGGCGGG CAAAACGAGG AGTTCCTGGA GAGCTAGCAA CACCTGATAA 
AAAAGAAAAT 

1 141 GATGCGAAGT CTTCAGATTC TAGCGTAGGT GAAGAAACTC TTCCAAGCCC ATCCCTGAAA 
1201 CCAGAAAAAA AGGTAGCAGA AGCTGAGAAG AAGGTTGAAG AAGCTAAGAA 
AAAAGCGGAG 

1261 GATCAAAAAG AAGAAGATCG CCGTAACTAC CCAACCAATA CTTACAAAAC GCTTGAACTT 
1 321 GAAATTGCTG AGTCCGATGT GGAAGTTAAA AAAGCGGAGC TTGAACTAGT AAAAGAGGAA 
1 381 GCTAAGGAAC CTCGAAACGA GGAAAAAGTT AAGCAAGCAA AAGCGGAAGT 
TGAGAGTAAA 

1441 AAAGCTGAGG CTACTAGGTT AGAAAAAATC AAGACAGATC GTAAAAAAGC AGAAGAAGAA 
1 501 GCTAAACGAA AAGCAGCAGA AGAAGATAAA GTTAAAGAAA AACCAGCTGA ACAACCACAA 
1561 CCAGCGCCGG CTCCAAAAGC AGAAAAACCA GCTCCAGCTC CAAAACCAGA 
GAATCCAGCT 

1621 GAACAACCAA AAGCAGAAAA ACCAGCTGAT CAACAAGCTG AAGAAGAGTA TGCTCGTAGA 
1681 TCAGAAGAAG AATATAATCG CTTGACTCTA CAGCAACCGC CAAAAACTGA AAAACCAGCA 
1741 CAACCATCTA CTCCAAAAAC AAANAC 

// 
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LOCUS DBL6A.DNA 1590 BP SS-DNA SYN 
DEFINITION - 
ACCESSION - 
KEYWORDS - 
SOURCE - 

FEATURES Location/Qualifiers 

CDS 127;. 1590 

BASE COUNT 701 A 261 C 339 G 288 T 1 OTHER 
ORIGIN 

1 AAACTATTAC ATGAAGTTAT AATCCCAAAT GGAAGCATAA AGAGATAAAT ACAAAATTCG 
61 ATTTATATAC AGTTCATATT GAAGTGATAT AGTAAGGTTA AAGAAAAAAT ATAGAAGGAA 
1 21 ATAATTATGT TTGCATCYAA AAGCGAAAGA AAAGTACATT ATTCAATTCG TAAATTTAGT 
1 81 ATTGGAGTAG CTAGTGTAGC TGTTGCTAGC TTGTTCTTAG GAGGAGTAGT CCATGCAGAA 
241 GGGGTTAGAA GTGAGAATAC CCCCAAGGTT ACATCTAGTG GGGATGAAGT CGATGAATAT 
301 ATAAAAAAAA TGTTGAGTGA GATCCAATTA GATAAAAGAA AACATACCCA CAATTTCGCC 
361 TTAAACCTAA AGTTGAGCAG AATTAAAACG GAGTATTT3T ATAAATTAAA AGTTAATGTT 
421 TTAGAAGAAA AGTCAAAAGC TGAGTTGACG TCAAAAACAA AAAAAGAGGT AGACGCAGCT 
481 TTTGAGAAGT TTAAAAAAGA TACATTGAAA CTAGGAGAAA AGGTAGCAGA AGCTCAGAAG 
541 AAGGTTGAAG AAGCTAAGAA AAAAGCCAAG GATCAAAAAG AAGAAGATCA CCGTAACTAC 
601 CCAACCAATA CTTACAAAAC GCTTGAACTT GAAATTGCTG AGTCCGATGT GAAAGTTAAA 
661 GAAGCGGAGC TTGAACTATT GAAAGAGGAA GCTAAAACTC GAAACGAGGA CACAATTAAC 
721 CAAGCAAAAG CGAAAGTTAA GAGTGAACAA GCTGAGGCTA CAAGGTTAAA AAAAATCAAG 
781 ACAGATCGTG AACAAGCTGA GGCTACAAGG TTAGAAAACA TCAAGACAGA TCGTGAAAAA 
841 GCAGAAGAAG CTAAACGAAA AGCAGAAGCA GAAGAAGTTA AAGATAAACT AAAGAGGCGG 
901 ACAAAACGAG CAGTTCCTGG AGAGCCAGCA ACACCTGATA AAAAAGAAAA TGATGCGAAG 
961 TCTTCAGATT CTAGCGTAGG TGAAGAAACT CTTCCAAGCC CATCCCTGAA ATCAGGAAAA 
1 021 AAGGTAGCAG AAGCTCAGAA GAAGGTAGCA GAAGCTGAGA AAAAAGCCAA 
GGATCAAAAA 

1081 GAAGAAGATC GCCGTAACTA CCCAACCAAT ACTTACAAAA CGCTTGACCT TGAAATTGCT 
1 141 GAGTCCGATG TGAAAGTTAA AGAAGCGGAG CTTGAACTAG TAAAAGAGGA 
AGCTAAGGAA 

1 201 TCTCGAAACG AGGAAAAAGT TAAGCAAGCA AAAGCGAAAG TTGAGAGTAA AAAAGCTGAG 
1261 GCTACAAGGT TAGAAAAAAT CAAGACAGAT CGTAAAAAAG CAGAAGAAGC TAAACGAAGA 
1321 GCAGCAGAAG AAGATAAAGT TAAAGAAAAA CCAGCTGAAC AACCACAACC 
AGCGCCGGCT 

1381 CCTCAACCAG AAAAACCAAC TGAAGAGCCT GAGAATCCAG CTCCaGCTCC 
AAAAC CTG AG 

1441 AATCCAGCTG AACAACCAAA AGCAGAAAAA CCAGCTGATC AACAAGCTGA AGAAGACTAT 
1 501 GCTCGTAGAT CAGAAGAAGA ATATAATCGC TTGACTCAAC AGCAACCGCC AAAAACTGAA 
1 561 AAACCAGCAC AACCATCTAC TCCAAAAACA 

// 



